Speech enhancement using 2-D Fourier transform

نویسندگان

  • Ing Yann Soon
  • Soo Ngee Koh
چکیده

This paper presents an innovative way of using the two-dimensional (2-D) Fourier transform for speech enhancement. The blocking and windowing of the speech data for the 2-D Fourier transform are explained in detail. Several techniques of filtering in the 2-D Fourier transform domain are also proposed. They include magnitude spectral subtraction, 2-D Wiener filtering as well as a hybrid filter which effectively combines the one-dimensional (1-D) Wiener filter with the 2-D Wiener filter. The proposed hybrid filter compares favorably against other techniques using an objective test.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

2-D Speech Enhancement based on Curvelet Transform using Different Window Functions

In this paper, an improved method based on Curvelet Transform using different window functions is presented for the speech enhancement. The window function is used for preprocessing of speech signals. In this method, instead of using two-dimensional (2-D) discrete Fourier Transform, Curvelet transform is employed with spectral magnitude subtraction method. General terms Spectral Substraction Me...

متن کامل

STFT-based speech enhancement by reconstructing the harmonics

A novel Short Time Fourier Transform (STFT) based speech enhancement method is introduced. This method enhances the magnitude spectrum of a noisy speech segment. The new idea that is used in this method is to basically reconstruct the harmonics at the multiples of the fundamental frequency ( 0 F ) rather than trying to improve them. The harmonics are produced, in the magnitude spectrum, using t...

متن کامل

Studies on Wavelet Based Linear Prediction Coefficients for Natural Sounding Speech using Different Windowing Techniques

This paper is aimed at to study LPC for Natural speech with different windowing techniques the effect of window shape on improving the speech quality by reducing the noise with the help of fixed and variable Windows with optimum shape. In the speech process signal corrupted by noise is segmented into frames and each segment is windowed using different Window with variation in the shape paramete...

متن کامل

Speech Enhancement Using Beta-order Mmse Spectral Amplitude Estimator with Laplacian Prior

This report addresses the problem of speech enhancement employing the Minimum Mean-Square Error (MMSE) of β-order Short Time Spectral Amplitude (STSA). We present an analytical solution for β-order MMSE estimator where Discrete Fourier Transform (DFT) coefficients of (clean) speech are modeled by Laplacian distributions. Using some approximations for the joint probability density function and t...

متن کامل

Speech Enhancement in the Dft Domain Using Laplacian Speech Priors

In this paper we consider optimal estimators for speech enhancement in the Discrete Fourier Transform (DFT) domain. We derive an analytical solution for estimating complex DFT coefficients in the MMSE sense when the clean speech DFT coefficients are Laplacian distributed and the DFT coefficients of the noise are Gaussian or Laplacian distributed. We show that these estimators have a number of i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Speech and Audio Processing

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2003